Using Partitioning to Speed Up Speci c-to-General Rule Induction
نویسنده
چکیده
RISE (Domingos 1995; in press) is a rule induction algorithm that proceeds by gradually generalizing rules, starting with one rule per example. This has several advantages compared to the more common strategy of gradually specializing initially null rules, and has been shown to lead to signi cant accuracy gains over algorithms like C4.5RULES and CN2 in a large number of application domains. However, RISE's running time (like that of other rule induction algorithms) is quadratic in the number of examples, making it unsuitable for processing very large databases. This paper studies the use of partitioning to speed up RISE, and compares it with the wellknown method of windowing. The use of partitioning in a speci c-to-general induction setting creates synergies that would not be possible with a general-to-speci c system. Partitioning often reduces running time and improves accuracy at the same time. In noisy conditions, the performance of windowing deteriorates rapidly, while that of partitioning remains stable.
منابع مشابه
Efficiency Improvement of Induction Motor using Fuzzy-Genetic Algorithm
In most industrial zones, electric energy is one of the most important energy sources. Since electrical motors are the main energy consumers of industrial factories, consumption optimization in these motors can be considered as a main option related to energy saving. One very effective way to reduce the consumption of these equipment is to use a motor speed controllers or drives. Since the loss...
متن کاملExperience-based Learning in Deductive Reasoning Systems
General knowledge is widely applicable but relatively slow to apply to any particular situation Speci c knowledge can be used rapidly where it applies but is only narrowly ap plicable We present an automatic scheme to migrate general knowledge to speci c knowledge during reasoning This scheme relies on a nested rule representation which retains the rule builder s intentions about which of the p...
متن کاملDIAGNOSIS OF BREAST LESIONS USING THE LOCAL CHAN-VESE MODEL, HIERARCHICAL FUZZY PARTITIONING AND FUZZY DECISION TREE INDUCTION
Breast cancer is one of the leading causes of death among women. Mammography remains today the best technology to detect breast cancer, early and efficiently, to distinguish between benign and malignant diseases. Several techniques in image processing and analysis have been developed to address this problem. In this paper, we propose a new solution to the problem of computer aided detection and...
متن کاملFuzzy Partitioning of Quantitative Attribute Domains by a Cluster Goodness Index Fuzzy Partitioning of Quantitative Attribute Domains by a Cluster Goodness Index
The problem of mining association rules for fuzzy quantitative items was introduced and an algorithm proposed in [7]. However, the algorithm assumes that fuzzy sets are given. In this paper we propose a method to nd the fuzzy sets for each quantitative attribute in a database by using clustering techniques. We present a scheme for nding the optimal partitioning of a data set during the clusteri...
متن کاملLearning in Deduction by Knowledge Migration and Shadowing
A method of deductive learning is developed to control deductive inference Our goal is to im prove problem solving time by experience when that experience monotonically adds knowledge to the knowledge base In particular for deductive reasoning systems where partial results are saved dur ing a derivation and at least some partial results are themselves deduction rules we suggest ways of taking m...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996